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1. Introduction 


In a previous paper [Uhlhorn, 1960 I]! some aspects of the phenomenological 
linear theory of irreversible processes were discussed. The present work is con- 
cerned with the probabilistic interpretation of the phenomenological theory and 
its relation to statistical mechanics. 

In the first sections of the paper a mathematical framework suitable for the 
statistical description of irreversible processes is presented. This framework is a 
somewhat extended version of the Hilbert space theory of stationary stochastic 
processes. Its formal structure is similar to that of quantum mechanics, and 
this analogy turns out to be fruitful (sections 5, 8, Appendix 2). The classi- 
fication of a stationary stochastic process as deterministic or non deterministic— 
these concepts were originally introduced by Wold [Wold, 1938]—corresponds 
closely to the classification of a physical process as reversible or irreversible 
(sections 8, 10, 13). A non deterministic stochastic process, which corresponds 
to an irreversible physical process, can always be considered as generated by a 
“purely random” mechanism. The connection between the stochastic process 
and its generating mechanism is expressed in the form of a “causal representa- 
tion’ of the stochastic process (sections 9, 10, 13). In sections 11 and 12 the 
application of the linear theory of stochastic processes to statistical mechanics 
is discussed, and section 13 is devoted to the probabilistic interpretation of the 
linear phenomenological theory of irreversible processes. 


2. Some basic probabilistic concepts 


In this section a probabilistic framework adapted for the treatment of ir- 
reversible processes is presented. We shall be concerned with the so called 
linear theory of stochastic processes? in a slightly modified and extended form. 

The basic concept of the theory of probability is that of an “event” and 
its assigned “probability”. Conventionally the events are represented as certain 
subsets of a basic “phase space” @. It is assumed that the complement of an 


1 Hereafter referred to as I. ; 
2 See in particular [Karhunen, 1947] and [Grenander and Szegé, 1958]. References to later 


publications on the subject can be found in [Jaglom, 1959]. 
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event is an event, that the empty set ¢ is an event, and that the logical sum 
of enumerably many events is an event. The totality of all events F is then 
said to be a sigma-field. In general a point in Q is not an event. To every 
event F there is assigned a probability P(F) that is a non negative number 
in such a way that P(Q)=1 and that the probability of the logical sum of 
denumerably many incompatible events is equal to the sum of the probabilities 
of the events. P(.) is then said to be a sigma-additive, normalized measure on 
(Q, J), and the structure (Q, F, P(.)) is said to be a probability space. 

A random variable «=2(.) is a real or complex function on Q, such that 
if J is an interval of the real axis or of the complex plane the subset # ‘J of 
Q, consisting of all points @ for which the value 2(w) is contained in J, is an 
event. Hence the probability distribution of « on the real axis or on the com- 
plex plane is well defined by P,(I)=P(x *J). 

The expectation or mean value of a random variable a is defined as the 
integral 


Ex=| x(w) P(dw)= | 2’ P,(d2') if Elz] <0. 


Q 


If Elz|?< co the random variable « is said to have finite variance. The to- 
tality of all complex random variables «x, y, .... with finite variance form a 
complex Hilbert space L?(Q, F, P(.)) with the scalar product 


<aly>=[x(w)y(m)P (do)! (1) 
Q 


and the norm 


Ile] =Lf |e (@) PP (do), (2) 
Q 


If Ex=Ey=0 the scalar product (1) is called the covariance of the random 
variables « and y. We note that e(m)=1 defines a random variable e with 
finite variance, ||e||=1. The expectation of an arbitrary random variable x with 
finite variance can thus be written 


Ex=e<elar>, (3) 


and it is seen that E may be considered as a projection on L?(Q, F, P(.)) 
onto the subspace of constant random variables spanned by the unit vector e. 

For the purposes of statistical mechanics of closed classical systems it is 
always possible to take the basic space Q as the phase space of the canonical 
coordinates. All intervals of the phase space are considered as events. In the 
statistical mechanics of macroscopic systems every physically relevant proba- 
bility distribution is absolutely continuous with respect to the “Liouville meas- 
ure” const. dp, ... dpndq,...dqn, that is it is characterized by a density 
f(Pr> -++; Pn» > +++, Qn). If the probability assigned to an event is zero the 
event is considered as unobservable. Events consisting of a single phase point 
are thus always unobservable. For quantum mechanical systems a correspond- 


* The bar denotes complex conjugation. 


274 


ARKIV FOR FYSIK. Bd 17 nr 16 


ing phase space can be constructed relative to a particular series of observa- 
tions [Uhlhorn, 1960 b]. 

The statistical description of open systems can in general not be given in 
terms of a probability distribution on the phase space only. If for instance we 
consider a system the dynamics of which is determined by Hamilton’s equa- 
tions combined with a random force acting at the boundary we will in general 
need an infinite dimensional basic space © to give a complete probabilistic 
characterization of the system and the changes taking place in it in the course 
of time even though the number of canonical coordinates describing the system 
at an arbitrary instant is finite. 


3. Stochastic processes with finite variance 


A stochastic process is a family of random variables. We shall consider fa- 
milies of random variables with finite variance exclusively. It is convenient to 
characterize a stochastic process as a random function: Let J be an arbitrary 
set and associate with each point t in J a random variable x(t). J is called 
the parameter set of the stochastic process x(.). For every fixed point w in 
Q),x2(., m) is a function of 1, a realization of the stochastic process. 

The function r(t,, tT.) defined by 


(Ty, Te) = (te, T%) = <2 (14) | % (72) (4) 

is called the covariance function of the stochastic process {x(t)|7 ET}. 
Let G,(J) be the set of all complex functions g(.), p(.), ... on the para- 
meter space J which are zero in all but a finite number of points. Evidently 


G,(J) is a complex vector space. Let a linear transformation from G)(J) into 


LP? (Q, F,P(.)) be defined by 


z[p]= > x(t) H(z). (5) 


TET 


We have obtained a new stochastic process x[.] with the parameter set G(T) 
which in contrast to the original process #(.) is a linear function of its ar- 
gument. Evidently the two processes 2(.) and t[.J determine each other 
uniquely. The covariance function corresponding to (5) is 


Rig, v) =<elpllely) = 2S rt 7) ) p(t) p (Te), (6) 


and is a Hermitian non negative functional on the vector space G(T) 
R(Q, y+ y2)=R(y, )+R(Q, Wo); R (Q, cy)=cR(Q, y) 
Rip, y) = Rly, 9) (7) 


R(p, v) 20." 


1 The third property evidently implies the second one as the vector space G, (J) is complex. 
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The range of a[.], that is the totality of random variables «|@] with @ in 
G, (J) is a linear space contained in LP? (Q, F,P(.)). The closure! of this range 
H (J) is called the Hilbert space of the stochastic process 2(.). 

Starting from any functional R(y~, y) with the properties (7) one can easily 
construct a Hilbert space G(J) which is isometrically equivalent to the space 
H(J), but is entirely “non probabilistic’. To see this we define for fixed y 
an antilinear functional f, on G(T) by 


fylg]=R@ ¥)- (8) 
The totality of functionals of the type (8) form a linear complex space 
fi eolet les fep=C]y> (9) 
and 
f,=0 if and only if R(y, y)=90. (10) 


The last proposition follows from the fact that 
[Rip y)PSR(g, @) Ry Y): (11) 
Hence a scalar product can be uniquely defined by 


olf =R y) llfll=R@ o)?. (12) 


and closing the space of antilinear functionals on G)(J) with respect to the 
norm (12) we obtain a Hilbert space G(J). Hence it is always possible to con- 
struct a Hilbert space corresponding to a prescribed non negative functional of 
the type (7). Evidently there is a one to one correspondence between the ran- 
dom variable x[p] and the functional f, and ||x[@]||=||f,||. Consequently the 
Hilbert spaces H(J) and G(J) are isometrically equivalent. 


4, Linear conditioning 


Let us assume that J, is a subset of J and that the process 2(.) has been 
observed for all values of t in J,. This means that we have obtained a par- 
ticular realization {x(t, @))|t€T,} of the stochastic process. This knowledge 
given, we want to estimate the value of the process for an arbitrary value of 
the parameter t. Thus we need some sort of conditional expectation of a(t) 
given the realization {a#(t,)|t,€I,}. Such conditional expectations can be con- 
structed in various ways. We shall use linear conditioning, that is we shall 
choose an estimate a(t, J,) of x(t) which is a linear combination of random 
variables a(t,) with parameter values in J, or a limit of such combinations. 
This means that we take «(t, J,) to be an element in H(J,). The estimation 
error is «(t)—2a(t, J,), and we want to determine w(t, J,) so that the variance 
of the estimation error ||«(t)— (zt, J,)||? is as small as possible. It is geomettri- 
cally evident that x(t, J,) has to be taken as the orthogonal projection of 
Biz) inte 4 (J;) 


a(t, J;)=E" x(t) (HE is the projection onto H(J,)). (13) 


' The closure is taken with respect to the norm (2); for the definition see e.g. [Halmos, 1951]. 
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It can evidently happen that 
Bec) = a(t) (14) 
even if t is not contained in J,. Then «(z) is said to be determined by the 
process in J,. 
It is obvious that the solution (13) of the estimation problem is purely for- 
mal. To obtain the real solution one needs an analytical characterization of 
the Hilbert subspace H (J,). 


5. Operations on stochastic processes 


Let a be a point transformation on the parameter space J of the stochastic 
process 2(.), that is for every t in J, at is a point in J. We define the 
linear transformation D[a] corresponding to a in the linear space G,(J) by 


Dlalp(t)= > y(t’). (15) 


It follows directly from the definition (15) that if a, and a, are two point 
transformations in J we have 


D [a,]D [a,] = D [ay 4,). (16) 


We shall further assume that the transformation a is bounded in the fol- 
lowing sense: 


R(D[a]¢, Dla]y) => > 71 (ary, 472) (41) Y(t.) < M2 R(Q, @) (17) 


T. 


2 


for every element m in G,(J) and some non negative constant M,. It is a 
direct consequence of (17) that 


R(y~, p)=0 implies R (Daly, Dlaly) =9. (18) 


Having obtained (18) we may without ambiguity define a linear operation 
D{a] corresponding to a in H(J). We begin by defining D{a] for elements of 


the type x«[q]: 
D{a)x(y)=2[D[a]¢]. (19) 


The definition (19) is unique for if x«[y]=a[y] we have according to (18) 


«[D[a]p|=«[D [a] y]. 
It follows from (17) that 
|| Dla] x{¢]||< Mell #[¢]|l- (20) 
Hence D[a] may be extended to the complete Hilbert space H(J), and 
|| D[a]||< Ma. (21) 
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' (t=) 
ra n Ta Ua ) '\ = de t 
It follows from (19) by taking g(t’) Oie tha 
D {ala (t)=x (ar). (22) 


If a is a point transformation in J satisfying the condition (17) and if a 
is a point transformation in J related to a by the condition on the covariance 
function 


(Tj, @%p) =7 (a*t, T,) (arbitrary t,, T,€), (23) 
then D[a*] exists and is equal to the Hermitian adjoint of D[a]: 
D{a*|=D{[a}*. (24) 
To see this we observe that (23) implies 
R(y, Dlaly)=R(D[a*] q, yp) (arbitrary —, yp € G(T). (25) 
Applying Schwarz’ inequality we find 


R(D[a*]y, D[a*]p)=R(y, Dla] D[a*] ¢) < 
<R(g, v)"R(D[a]D [a*] , Dla] D [a*] 9)" < (26) 
<R(y, y)" Ma R(D[a*] gp, Dla*] yp)", 


and hence 
R(D[a*]~, Dla*]y)<M2R(Q, ¢). (27) 
It follows that D[a*] is well defined and that 
<D[a*)a[¢]|«lpl> =<x[y]|Dle*]*|2 yh =<xlg]|Dia]|alyh, (28) 
and by continuity D[a*|]*=D[a] as was to be proved. 
If a, and a, are two point transformations in J satisfying the boundedness 


condition (17) it follows from (16) that 


D{a,| D{a,|)=D{[a,a,). (29) 


6. Symmetries of a stochastic process 


A point transformation a on the parameter space of the stochastic process 
w(.) will be called a symmetry transformation of the stochastic process if 


a. L(O%y, bt) —1( ty, ve) (anbitwary ty, 756 DT) 
(30) 


b. a@ has an inverse a“', satisfying a tat=aa't=t (arbitrary tE7). 
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From (30) a. it is seen that the boundedness condition (17) is satisfied with 
M,=1, and we find 


D{a]* D[a]=1, (31) 


that is D[a] is an isometry of the Hilbert space H(J). From (30) b. we find 
that im (23) we may take a*=a™' and using (29) we finally obtain 


D[a}-1 =D [a]*=D[a-], (32) 


that is, a symmetry operation of the stochastic process gives rise to a unitary 
transformation in the Hilbert space of the process. 

The symmetries of a stochastic process, which evidently form a group, may 
be said to correspond to the integrals of motion of a dynamical system or 
rather to the transformations generated by the integrals of motion. The explicit 
representation of the stochastic process as a vector in the Hilbert space H (J) 
is greatly simplified if one uses a “‘coordinate system’ in which as many of 
the symmetry transformations as possible are “diagonal”. We see that there 
is a certain formal analogy between stochastic processes with symmetry opera- 
tions and quantum mechanical systems with integrals of motion. 

Let a be a symmetry operation of the stochastic process a(.) and [](J,) 
the orthogonal projection onto the subspace H(J,) corresponding to the subset 
J, of the parameter set J. Then 


D{a]T1(J,) Dla}*=[1(aJ,) or 


(33) 
D [a] I] (T,) _ II (a Ji) D [a]. 


To obtain the relation (33) we note that D[{a][](T,) D[a]* is a projection, 
D{a] being unitary. Take x€H(aJ,) that is [l(aJ,)e=2. « is the limit of 
elements of the form z[y], y(.)€G@)(a7,). It follows that 


D{al{](F,) Dla\* zy] =2[y], 


and hence D[a][|(J,) D[a]*x=«. Conversely if D[{a][](J,) Dla|*¥a=x we have 
I1(J,) D{a]*«=D[a]*x and hence [](aI,)x=~. It follows that the projections 
are equal. 


7. Stationary vector processes 


In this section we shall specialize the parameter set J of the stochastic 
process a(.). Each point in J is assumed to be of the form 


t=(t, u), T=(R'x JV) (34) 


with the coordinate ¢ taking values on the real axis and representing the time 
and the coordinate u being an element of the m-dimensional complex vector 
space V introduced in I, section 3. It is assumed that a(f, «) depends linearly 


on u, that is 
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a(t, Uy + Ug) =a (t, uy) + a(t, Uy), w(t, Au) =Ax(E, u). (35) 


It follows from (35) that the covariance function defines a linear operator 
Rit, t) on V 


<x (ty, Uz) |x (ty, Wy)> = (Uy (pe: (ee) | TA (36) 
We shall call R(t,, t,) the covariance operator. It has the property 
R (t,, t2)* =R (te; ty) (37) 


and for arbitrary ¢,, ..., tz, Uy -.., Ux 


corresponding to (7). 
The relation (35) suggests the following notation 


x(t, w) = (x(t) | wu), x(t, u)=(u | x(t), (39) 
with w(t) a random vector taking values in V and 
R(t, t) = E| x (t,)) (x (t) |. (40) 


The process a(.,.) is assumed to have the following symmetries which we 
express as conditions on the covariance operator 


a. Stationarity: R (t,, t.) = R (t,—t,), 

(41) 
b. Time reversal invariance: 7 h{—t,, —#,)7'*—=Fit., ¢.), 
where 7’ is the time reversal operator introduced in I, equation (3). 


The unitary operators in H(R'x V) corresponding to (41) a. and b. are the 
group of translation operators U (t) 


U (ty) x(t, u)=x(t+t, wu), (42) 
and the time reversal operator 7 
Tat, vw)=2(—t, T*¥ x). (43) 
We shall further assume that 


a. R(t) is continuous at the origin, that is lim R(t) =R(0) 


t>0 d 
b. R(0) is non singular. — 


It follows from (44) a. that U(t) is a (weakly) continuous function of ¢ and 
that the Hilbert space of the process H(R'x V) is separable that is has a de- 
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numerable basis. (44) b. on the other hand is no essential restriction, for 
R(0)u=0 is equivalent to x(0, u)=0 and hence to a(t, w)=U(t)x(0, wu) =0 for 
any value of t, and we may restrict ourselves to vectors uw from the subspace 
of V on which R(0) is non singular. 

If the stochastic process a(.,.) represents the values of an observable as- 
sociated with a closed system described by classical mechanics or by quantum 
mechanics it is possible to derive the condition (41) b. from the mechanical 
laws. The translation operator U(t) is in this case directly connected with the 
microscopic dynamics of the system and the time reversal invariance of the 
basic equations of motion is represented by the commutation relation 


TU (t)T=U(—?) (45) 


and the unitarity of 7. 


8. Deterministic and nondeterministic processes. Dissipation 


We introduce the simplified notation H,=H ((— 00, t])x V). The subspace A, 
consists of the full past of the stochastic process av(.,.) at the time ¢ and 
evidently H;., contains H; for any positive h. It follows that the projection 
I], onto H,; is monotonically non decreasing as function of ¢. Hence the limits 


lim [],=]]|_.~ and*tm |ly=[],..2 (46) 
to>+a 


t>-~ 


exist as welldefined projections on the Hilbert space of the process. It follows 
directly that [].., is the projection onto the Hilbert space of the process, for 
any element of that space can be approximated by an element of the form 
x[{q] (5) and for any such element I[..2[gy]=2[g]. We observe that the limits 
(46) exist whether the process is stationary or not. 

In general []_..<]].~. If []-~=I[].« we shall say that the stochastic pro- 
cess a(.,.) is deterministic. The meaning of this condition is obviously that the 
future of the process at an arbitrary time ¢, is determined by the full past of 
the process at an arbitrary time fy. 

Now we take into explicit account the fact that the process is stationary. 
Applying the equality (33) to [], and U(t) we find 


U (t,) Th, U (t,)* =] Tess, (47) 
and combining (46) and (47) 
Cine d Os Hence) les O O)*= ll hes: (48) 


The relations (48) express the fact that the subspaces corresponding to the 
projections [[_,, and |]... are invariant under U (t). It is easily seen that the 


1 The limits exist in the sense lim || (II, ~ IL +00) # ||=0 for any x€H (R'x YV). 
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stochastic process is deterministic if and only if for two arbitrary fixed times 
t, and t,, t,<#, we have 


Tl, =Ile, or equivalently H;,= A;:,. (49) 


The “only if” proposition is trivially true. Conversely it is seen that (49) and 
the monotony of []; imply 


Tl, =[ljsn=T7 (2) Ti, 0 (h)* = (O<h<t,—t,) (50) 
and hence [],, = U(h)[]:,U (h)* = U (2h) he, U(2h)* =--- =U (— h) Th, U(—h)* = 
=U(-—2h)|]:,U(—2h)*=-:-. An arbitrary time ¢ can be written in the form 


t=t,+nm-h, with nm, integer and 0<fh;<t,—t#, and I], is found to be indepen- 
dent of ¢, so that [[,=]]_.=[].. and the process is deterministic. 

If [[-..=0 the stochastic process is said to be non deterministic. It is evident 
that in general the stochastic process 2(.,.) may be considered as a linear 
superposition of a deterministic part |] ..27(.,.) and a non deterministic part 


(lice — es) ele, 2) Writing 
w(t, u)=[L. a(t, ) and 2, (t, u)=a«(t, u)—2), w) (51) 


for the deterministic and the non deterministic part respectively, we can write 
the covariance operator R(t) as a sum of two Covariance operators 


R(t) = Ro (t) + Ry) (52) 
corresponding to the decomposition into two uncorrelated processes (51) 


(wu | Ry (t) |v) = <x_(t', u) | tp (t' +t, v)> 


(53) 

(w| R, (¢)|v) = <a, (t’, u)| a, (t' +4, v)d. 
Evidently the stochastic process a(.,.) is deterministic if and only if R,(0)=0 
and non deterministic if and only if R)(0)=0. In general we have 


Rank {A (0) + R, (0)} =m, 0< Rank {Ry (0)} <m, 0< Rank {R,(0)}<m. (54) 


It is seen that deterministic and non deterministic processes can be studied 
separately. 

If the stochastic process a(.,.) has no deterministic part and w is an arbi- 
trary nonzero vector in V, it follows immediately that the (one-dimensional) 
stochastic process x(., wv) is non deterministic, for evidently the corresponding 
subspace H ((— oo, #]x{w}) is contained in H ((— 0, t]x V) so that from (46): 


lim H((— co, t]x{u}) = {0}, (55) 


t+>— 0 


that is the subspace consisting only of the null-vector, whereas as a consequence 
of the assumption w+ 0 
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lim H ((— oe, t]x {u}) (56) 


t++00 


contains at least one nonzero element. Hence it follows in the general case 
that if we V is such that R,(0)w+0, then the (one dimensional) stochastic pro- 
cess x,(., uw) is non deterministic. 

We shall study the non deterministic part of the stochastic process 27 (.,..); 
assuming [|[_..+[]... It is seen that F;, defined by 


eee ee (57) 


is a spectral distribution! on the U (t)-invariant subspace corresponding to, the 
projection []...—[]_., that is H...QH_.., the Hilbert space of the non deter- 
ministic component. Hence we may define the group of unitary transformations 
V(A) on H,..CH_o by 

V(A)=fexp {isd F, (—2 <A< +00). (58) 


From (47), (48) and (51) we readily obtain the commutation relation 
U (t) V (A) =exp{—tAt} V(A) U (t). (59) 


Here U(t) is a continuous function of t¢ according to our assumption (44) a. 
and V (A) is a continuous function of 4 according to its definition (58). 

We observe that (59) is an expression of the commutation relation for ca- 
nonical coordinates and momenta in quantum mechanics. For writing heuristi- 
cally 


U (t)=exp {tg} =1+ttq+h (ita) +... 


(60) 
V (A) =exp {iAps=1+tApt+h(tap)yt--: 
we obtain by formal calculations from (59) 
qp—pq=1l, (61) 


that is the Heisenberg form of the commutation relation. In this connection 
the equation (59) is sometimes called Weyl’s relation [Weyl, 1927, and 1931 
§ 14, 15]. It is known that if q is represented by the multiplicative operator 
in the space L?(— co, + 0°; dq) the representation of p is determined to be 


a d F (q) tat Me: =| es 1 
=-71% = —iF —1—| exp 1 F' (q)}, 62 
(Paap pigs so cream a aah (9) agen (q)} (62) 


where F(q) is an “arbitrary” real function of g. That means that p is deter- 
mined by g and (61) up to a unitary transformation commuting with q. A 
simple proof can be found in [Dirac, 1947] and a rigorous version in [von 


1 This concept is defined in Appendix 2. 
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Neumann, 1931]. For our purposes a somewhat more general representation 
theorem for the groups connected by (59) is needed. It is not convenient to 
work with the infinitesimal generators g and p which are unbounded operators 
in the Hilbert space of the stochastic process and hence cannot be defined for 
every element of that space. With the aid of this representation theorem, which 
is proved in appendix 2, it is possible to obtain an isometric connection be- 
tween the Hilbert space of the stochastic process and the Hilbert space of m- 
dimensional vector functions on the real axis L%, to be defined below, such 
that roughly speaking q is represented by the infinitesimal translation on the 
time axis id/dt and p is represented by the multiplicative operator, ¢. Be- 
fore discussing this representation we shall be concerned with certain dissipative 
properties of non deterministic stationary stochastic processes. 

The subspace H({t}xV) corresponding to one single instant ¢ is unitarily 
equivalent to the subspace H({0}=V). We have assumed in section 7 that the 
operator R(O) in V is non singular, and as was pointed out there is no essential 
restriction in this assumption. It follows that x(0, uw) may be considered as an 
invertible mapping of V on H({0}xV) 


a(0, uy=Xu; uw=X *x(0, u). (63) 

The adjoint of X, X*, is a mapping of H({0}xV) on V 
<Xu|x(0, v)) =(u| X*x (0, v))=(u| X*X]v) or X*X = R(0). (64) 
Let us now consider as in section 4 the (linear) estimate of x(t, w) when 


the stochastic process has been observed up to the time 0. The best linear 
estimate is determined by the projection 


[])u(t, «)=[], U @) x (0, w)=U ()[ 1-2 (0, w), (65) 
and the estimation error x(t, v)—[][,7(é, vw) has the variance 


"—|[TI,x(t, «) |/?= 
=||2(0, w) 


lla(t, x) —[] x (é, u) ||? =|] a(é, 2) 


*—||T]-2”(0, w)||?. (66) 


It follows from the first of the relations (46) that the variance of the estima- 
tion error is a monotonic function of ¢, equal to zero for t=0 and tending to 
its maximum value ||x(0, «)||?—||T]_.2(0, w)||? when t>oo. If a(.,.) is non 
deterministic this maximum value is equal to ||«(0, u)|| otherwise it is smaller 
and in particular for a deterministic process it is equal to zero. It is suggestive 
to compare the monotonic increase of the estimation error variance with the 
monotonic increase of thermodynamic potentials and this point will be discussed 
further below. 

From the last expression of the estimation error variance in (66) it can easily 


be seen that this quantity may be associated with an operator &(t) in the 
m-dimensional vector space V 


Pd 


(u | E (t) | v) = <x (0, u) | x(0, v)> — <x (0, x) | ce | x(0, v)> (67) 


ARKIV FOR FysIKk. Bd 17 nr 16 
or €(t)=R(0)—X*]|]_;X, where X is the mapping defined by (64). The ope- 
rator E(t) is non negative, (w|€(t)|w)>0 for arbitrary w in V, and increases 
monotonically from €(0)=0 to a maximal value & (co): 


ceo = Fe(Q) ax Pa (68) 


It may be noted that the derivative of €(t) exists and is non negative. This 
is a consequence of the fact that, as proved in appendix 2, the “spectral 
distribution” []; is absolutely continuous. The stochastic process 7(.,.) has a 
non deterministic component if and only if for some f, e.g. f=0, we have 


d 
oF E(t) +0. (69) 


The derivative is necessarily non negative. 

It must be noticed that the estimation discussed above is built on the ob- 
servation of the full past of the stochastic process. If the full past of the pro- 
cess is not known the estimation has to be modified in an obvious way. In 
this case we can in general not expect a monotonic behaviour of the estima- 
tion error variance. We consider the extreme case of estimating a(t, w) from a 
single observation which gives us «(0, .). The estimate x(t, uw, {0}) is determined 
by the projection of x(t, u) into H({0}*V), that is by the scalar products 


<x (0, v) | x(t, u, {0})> = <x (0, v) | U (t) | “a {(O), Hh)» 


(70) 
(v| R(0)|X~* x(t, w, {0}))=(v] R(t) | w), 
and hence 
X~ x(t, wv) {0})=R(0)* R(t) wu or Gay 
a(t, u, {0})=2(0, R(0)* R(t) x). 
The estimation error variance is given by 
(w| R(0) — R(t)* R(O) * R(t) |u). (72) 
It follows that the operator 
R(0)—R()* RO) R(t) >0 (73) 


is equal to zero for t=0, but in general its dependence on ¢ is not monotonic. 
If the m-dimensional Gauszian stochastic process corresponding to the covariance 
operator R(t) (see below, section 11) is mixing, that is if 


lim & (t)=0 (74) 
we find that the estimation error variance approaches its maximal value R (0) 
when ¢ tends to infinity. 
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9. Causal functions and transforms 


We have seen in the last section that a non deterministic stochastic process 
is always associated with the two groups of unitary transformations U(.) and 
V(.) which are related by the commutation relation (59). As we shall see in 
the next section the existence of these two groups is sufficient to establish a 
very simple representation of the stochastic process. The framework for this 
representation is the Hilbert space of m-dimensional vector functions on the 
real axis, L*,, and this section is devoted to a study of this space. 

Let V be the m-dimensional complex vector space introduced in section 7. 
By Ly we denote the class of all functions on the real axis with values in J, 
such that 


+00 


{ (v,| vp): d.t< = co. (75) 


— 0 


It is easily shown that L*, is a Hilbert space with the scalar product 


+ 00 
OO = ) (v, | u,)dt (76) 
and the correspoding norm ||v.||?=<v.|v.>. 


On Li, we introduce the group of translations U, (.) 

[U.Qv.ly=u+4 (v.€Ly, —o <t< +00), (77) 

and the multiplicative group V,(.) 
[V.(t)v.]p=exp {ttt'} vp. (78) 
It follows at once from the definitions of U,(.), Vz(.) and the scalar product 
(76) that the transformations (77) and (78) are unitary and that they are con- 

nected by the commutation relation 
Uz (t) Vz(t') U.()7* V(t’) *=exp {-tt#'}, (79) 


which is just the Weyl relation (59). 
We shall further introduce the transformation J, inversion at the origin, 


[J v.],=v_y. (80) 
The inversion transformation is obviously a unitary involution 
Jom id? ad | id (81) 
and connected with the group of translations by the ‘time reversal relation” 
JU,(t)=U.(—-#) J, (82) 
J V.z(t)=V.z(—t) J. 
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Complex conjugation may be represented as an anti linear, isometric trans- 
. . 9) 
formation C on Ly 
[C v. le = C! Ut (84) 
where Cv, denotes the complex conjugation operation in V, as defined in Nee 


section 3. 
From the definitions U,(.) and V.(.) it follows at once that 


CU, (t) =U, (t) C, (85) 


CVO=V.(-HC. 


The Fourier-Plancherel transformation S in L*, is defined by 
+00 


exp {—itt} dt! vy, (86) 


[S Uv. lt — = 


where the integral is defined as a limit in the mean or as 


|e! Pex {f—itt’}—] 
= | = G# 2 dt! vy. 
—4 


(87) 


The Fourier-Plancherel transformation S is unitary (Parseval’s theorem) and 
satisfies 


Fa), sta 1, (88) 
and hence si J sh =e 8 (89) 
Further it is obvious that OS= pt C, (90) 


By direct application of the definitions it is found that U(.) and V(.) are 
connected by the Fourier-Plancerel transformation 


U, (t) S=SV, (b), (91) 
or SU,08=VAt); 8*V.(()S=JU,6) J=U,(—2). (92) 
A family of projection operators {F'.,,| —°o <t< +o} is defined by 


Vt i <t 


aS 93 
[Fat?e= yoy (93) 


Evidently F.,., when properly normalized, is a spectral distribution corresponding 
to the spectral resolution of the group V,(.) 


+ 00 
Vi(t)= [ exp {iti} dy Fav. (94) 


-o 
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. 9 . . 
A vectorfunction v.€ Li is called causal if 


Fo Ce = (0): (95) 


. 9 . . . . 
and a vectorfunction u.€ L4, is called a causal transform if its Fourier transform 
is a causal function, that is if 


FioSu. =0. (96) 


The concepts causal function and transform have arisen in connection with the 
study of so called stationary causal filters, which are represented by linear, 
translation invariant transformations which have the property of transforming 
causal functions into causal functions, and are characterized by “‘dispersion rela- 
tions’; see e.g. [Toll, 1956}. 

The projection onto the subspace of causal transforms is according to (96) 


P=" F.0S. (97) 
It follows from (90), (89), (92) that P is connected with the conjugation through 
CR=4— f£) 0: (98) 


We introduce the additive but non homogeneous operations “‘real part’? and 
“imaginary part” 


R=4(1+0C), (99) 
1 
de len (100) 


and the real transformation } in L*, (the Hilbert transformation) 


H=2IPR=1(1—2P)R, (101) 
RH=HR=H. (102) 


It follows at once from the second expression (101) that 
HU=—R (103) 
and that ||F#v.|]=||Rv.|| (v.€ £4), (104) 


Hence 7 is an isometric transformation of the real Hilbert space RL‘, onto it- 
self, and its inverse is —}. It follows further that with v. arbitrary in L, or 
Rv. arbitrary in RL’, 


Rv. +iH Rv. =2P Rr. (105) 


is always a causal transform. On the other hand we have 
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RTD Pel ed Mn or for Pv.=v. HRv.=IJv. (106) 
HIP=—_RP, ortor Pov. PIV == Ro. (107) 


showing that a causal transform is always of the form (105). We have obtained 
by algebraic methods the wellknown criterion [Titchmarsh, 1948]: a function is 
a causal transform if and only if its real and imaginary parts are connected 
by a Hilbert transformation. Usually the criterion is derived from the explicit 
representation of the Hilbert transform 


[HRv.j= = Prine. value | Rods, (108) 
which is easily obtained from (101). 

There is a further wellknown criterion: a function is a causal transform if and 
and only if it can be extended to a function which is analytic in the upper 
half of the complex plane (for the precise formulation see e.g. [Titchmarsh, 
1948]). 

So far the fact that we are dealing with vector valued functions has played 
no essential role as all statements can be referred to complex functions of the 
form (u|v.) with w€V and v.€ Ly. 

We shall now consider a family of linear transformations f(A) on the m-di- 
minsional vector space V, defined for every real A, such that for every vector 
v€V the vector function Jf(A)v is a causal transform. This implies in particular 
that for every v, f(.)v is an element of Ly, and hence the integral 


+00 
[ FA F(A) da (109) 


exists and defines a non negative operator in V. Further the Fourier-Plancherel 
transform of f(.)v 


Sf(.)v=g(.)v (110) 


defines a family of linear transformations g(.) on V with the property 


(111) 
or g(t)=0 (t>0). 


From the unitarity of the Fourier-Plancherel transformation S and the relation 
(92) it follows in view of (111) that 


+00 min (0, ¢) 
f exp {i2¢} f(a)* f(A) dA= fg)" o(t -far’. (112) 


It is not difficult to see that one can always find families of transformations 
f(.) with the desired properties. Excluding the trivial case {(.)=0, we can al- 
ways find a finite real number f,=0 such that f, is the smallest number for which 
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0 to 


f ott ge )at'= f[ gt) ot) ar. (113) 


=06 -@* 


It is no restriction to assume t,=0, for if this is not orginally the case we can 
replace f(A) by exp {ito} f(A). 
We consider now the subspace of Li, which is generated by all elements of 
the form 
SS elt, wUz) g(.)u (114) 


t< we V 


(=) 


where the complex function c(t, wv) is equal to zero for all but a finite number 
of values of its arguments. Evidently this subspace which we shall denote by 
L(0) is contained in the subspace F’,,o L*, formed by all vector functions in Ls 
which vanish on the positive part of the real axis. In general L(0) will be a 
proper subspace of Ly. The family of subspaces L(t) defined by 


L(t)=U(t) L(0), (115) 


is evidently increasing with ¢t: L(t+h) contains L(t) if h>0. Hence the limits 
L(— co) and L(+.) exist, and evidently 


L(—cc)={0} (the subspace consisting only of the 0-vector) 
LG oie Le 


It follows from the form of the elements (114) that L(+ co) consists exactly 
of all vector functions of the form 


+ 00 


f(A)or, Jf (eal f(A) #(A)| va) da < +00. (116) 


— 0 


Hence we find that L(+ 0cc)=L*, if and only if f(A) has an inverse {(A)~! for 
every value A. In this case only every element of Li, can be represented in 
the form (116). 

It follows further that L(+ cc)=L¥ if and only if L(0)=F.,0 Li. Evidently 
the condition is sufficient for L(0)=F.2,o Ly implies that L(t) =F.,, L*, for every 
value t. To see that the condition is also necessary we assume that L(+ co) =L*,. 
As we have seen this is equivalent to the existence of f(A)’ for every A. The 
elements of L(0) are the Fourier-Plancherel transforms of elements of the form 
f(A)v, such that Jf(.)v. is a causal transform. On the other hand as f(A)~? 
exists every causal transform can be represented in this form, and the proposi- 
tion is proved. 

We shall conclude this section with a discussion of the following problem. 
Given a family of non singular positive transformations o’ (A) on V defined for 
every real 4. It is assumed that o’(.) is integrable over the real axis. Under 
which conditions is it possible to represent o’(.) in the form 


a’ (A) = f(A)" f(A) (117) 
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such that f(A)v is a causal transform for every v€V. Evidently a necessary 
and sufficient condition is that the Fourier-Plancherel transform of o’ (A) can 
be represented in the form (112) 


+00 min (0, f) 


J o'(A)exp {iAhdA= | git’)*gtt’-Hae’. (118) 


—-o 


It is known that in the one-dimensional case (dim V=m=1) a necessary and 
sufficient condition for the possibility of a factoring (117) is that 


log o’ (A) 
——-———d},j>—0c0 119 
area os 
and that the factor f(A) is given by 
+00 a1 
, 1 1+ tA log o’ (t 
Se be ee 2 
(A) exp | al fA 14 at} rote 


The factoring problem is identical with the problem of determining the scat- 
tering phase shift from the scattering amplitude [Toll, 1956]. In [Karhunen, 1950] 
the general solution of the factoring problem is derived. 

One can expect that in the multipdimensional case (119) and (120) hold as 
operator relations in V. However due to computational difficulties such rela- 
tions are almost useless. On the other hand it is easy to derive some condi- 
tions which are only necessary. E.g.: the factoring problem (117) has a solution 
only if for each non zero element v in V 


== 


{ log (v|o"(A)|%) 7, (121) 


1+/? 


or, the factoring problem has a solution only if 


ie log {Trace o’ (A)} 


eae Tite oot (122) 


—o 


10. Representation of a non deterministic process 


We shall now return to the stochastic process x(.,.) introduced in section 7. 
According to Stone’s theorem on the representation of one-parametric groups 
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of unitary transformations, the translation operator U (t) has the spectral resolu- 


tion (see appendix 2) 


U(t)= { exp {itaA}dB, (123) 


corresponding to the Cramér-representation of the stochastic process 
x(t, v)= | exp {itd} d,z(A, v), (124) 
where z(.,.) is a stochastic process with uncorrelated increments 


z(A, v) =(£,— Ey) x (0, v), (125) 


and corresponding to the Cramér-Khinchin representation of the covariance 
operator R(t) (section 7) 


+ co 
R(t)= f exp {ita} do(A) (126) 


where the spectral distribution operator on V,o(.), is determined by 


(w | a (A) | v) = <x (0, u) | E,| alOxv) ys (127) 
It follows that 
dy, 2 (Ay; v)| da, 2 (Ap, U)> = 6a, a, da, (v| o (A,) | u) (128) 


where 06,,,,, is Kronecker’s symbol. 

Applying the decomposition theorem proved in appendix 1 to the spectral 
distribution #. we obtain a decomposition of the stochastic process x(.,.) into 
three mutually uncorrelated processes having respectively an absolutely continu- 
ous, a singular discrete (jump), and a singular continuous spectral distribution 
operator on V 


o (A) = 07 (A) +04 (A) + 6 (A) 


A 


o (A= J o' (udu, (ulo2(A)|v) =<x(0, w)| H3|x(0, v)> 


(129) 
a (A)= > a (A, +0)—o(A,—0), (w|o% (A) |v) = <x (0, w)| B37] x (0, v)> 


i 


a (AK (w| o° (A) | v) = (x (0, u) | EX° | x (0, v)>. 


It is easily seen that this decomposition of the stochastic process is invariant 
under the time reversal transformation J’ introduced in section 7 eq. (43) so 
that each of the three components of the process satisfies the symmetry rela- 
tion (41 b), (see also [Uhlhorn. 1960 b]). The decomposition of the stochastic pro- 
cess into a deterministic and a non deterministic part was described in section 8. 
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It was shown in this section that the non-deterministic part of the stochastic 
process is characterized by the Weyl relation (59). On the other hand it is dem- 
onstrated in appendix 2 section 4. that the Weyl relation implies that the 
spectral distribution #. is absolutely continuous. Hence the singular parts of the 
spectral distribution operator (129) correspond to deterministic stochastic processes 
whereas the absolutely continuous part o*(.) may in general determine both a 
deterministic and a non deterministic stochastic process. In accordance with 
equations (52), (53) 


go (A) = 09 (A) +0, (A); 6% (A) = 0, (A) + 06 (A), (130) 


Hit, U)=mara(t, Vv) tq (t, 0) 4-2, (t, ©) + 2, (t, v). (131) 


We are now going to study the non deterministic component x, (.,.) of the 
stochastic process, assuming rank R,(0)+0. In view of the orthogonal decom- 
position (131) we may avoid clumsy notations by assuming for the moment 
that the stochastic process consists only of a non deterministic component 
7) [or a Cae bP 

The Hilbert subspace H,= H ((— cc, t]x V) of the process (section 8) is gener- 
ted by all finite sums of the type 


Se eh ale al > elbow) OE) 2 (0408) = 
t’<t, we V t’<t, ue V Pe (132) 
= ee CU) f exp {it A} d,z(A, u). 
Pt, wey, =) 


As o(.) is absolutely continuous and non negative we can always write 


A 


o(A)= [ flu)* tu)dp. (133) 


—o 


Combining (132) and (133) we find that every element in H,, is uniquely mapped 
onto an element of the Hilbert space of m-dimensional vectorfunctions Ly, in- 
troduced in section 9. The mapping is linear and isometric and is determined 


by the correspondence 


2 (0, v)>f(A)v (134) 


U (t) x (0, v) > exp {ita} f(A)». 
As is seen from the discussion of section 9 this mapping, determined by (134), 


is invertible exactly if the operator f{(A) has an inverse for every A. This condi- 
tion is indeed satisfied for if v is a nonzero element in V we have for every A 


(vja(At+h)—o(A)|v)>0 (h>0), (135) 


and hence the operator o(A+h)—o(A) has rank m if h+0. The inequality (135) 
is a consequence of the fact that for fixed v€V, x(.,v) is a one-dimensional 
non deterministic stochastic process (section 8) and the fact that the spectral 
distribution corresponding to a non deterministic process can have no gaps (ap- 
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pendix 2, section 5). From the connection between o(.) and f(.) (133) it follows 
that f(A)! exists for almost every 2. We shall denote the unique inverse of 
the mapping (134) by Y,. That is Y, is a linear isometric transformation of 
the Hilbert space Li, onto the Hilbert space H.. of the stochastic process: 
Y,f(.)v=x(0, v0 
¥z1U() Y,=V2(t) (eq. (78). 


Hence the L*,-representative of U (t) is the multiplicative transformation V, (1). 


We want to determine the representative of V (t) (58), Y;' V(t) Y,. It follows 
from the representation theorem for the Weyl relation proved in appendix 2 
that Y;'V(t) Y, is equivalent to the translation operator U,(—t) (77) up to a 
unitary transformation W which commutes with VY; LUNE) Y,, that is 
Ux=h=W Ya Vie) rey 
(137) 
Vioj=W YY, U®Y,W=yr7 UOT, 

We define the isometric, linear transformations Y and Z of Li, onto H,, by 
Y=Y,W, (138) 
Z=YS8S* (S defined by (86)). (139) 

Combining (137), (138), (139), and (91) we find 

U(t)Z=ZU,(t) (eq. (77)), (140) 
V()Z=ZV.(t) (eq. (78)). (141) 


From the definition of V(t) (58) and of V,(t) (78), (94) it follows the corre- 
spondence between the spectral distributions F’. and F,,. 


Zh = Pe. (142) 


The transformation Y~'=(Y,W)'=W* Y;* can be analytically characterized 
by a correspondence of the form (134). This follows from the fact that (see ap- 
pendix 2) the general form of a unitary operator in LY, commuting with V,(.) is 


[Wv.],=w(d)v (143) 
where w(A) is a unitary operator in the m-dimensional space V defined for every 
real A. Hence, replacing in (134) f(A) by w(A)* f(A), we obtain the representation 
of Y"". In the following we shall denote the representative of Y~! by f(A): 

YraOpy)F (ie: (144) 
Further we introduce in correspondence to eq. (110) 
Z*2(0,v)=S}f(.)v=g(.)o. (145) 
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We have obtained the following representations of the non deterministic 
stochastic process x(.,.) 


a(t, v) = YV,(t) f(.)v=ZU, (t)g(.)v. (146) 


Here Jf(.)v is a causal transform for every v€V. This follows from (142) and 
the definition of F;. As 


F209(.)v=F20Z2°' «(0,v)=Z* Fy x(0, v0) =Z-' x (0, v) =9(.) 2, (147) 
we have FioSJIf(.)v=F.i0J9(.)v=0, 


and the statement is proved. 
Corresponding to (145) we obtain a representation of the covariance operator 


R(t) (36), (118) 
es min (0, t) 


R(t)= [ f(A)" f(A) exp {iat} da= He Galois — Wiad te (148) 


We summarize the properties of the transformation Z from L*, onto H.,. 


a. Z is an invertible, isometric linear transformation. 
b. U(t)Z=ZU,(t). (149) 
ce. V(t)Z=ZV.,(t) or equivalently fF, Z=ZF,,o. 


The transformation Z is page determined by g(.) (145) or equivalently by 
f(.). Conversely g(.) and f(.) are uniquely determined by the properties (149) 
up to a unitary ie aH in the m-dimensional space V. To see this, as: 
sume that Z, is a transformation satisfying (149). It follows directly that Z:*Z 
is a unitary transformation on L*, which commutes with U(.) and with V(.). 
Hence Z;'Z is determined by a unitary operator in V, say w and 


wg(.)v=Z,'Zg(.)v=Z' x(0, v) =9,(.) 2 (150) 


for every vEV. 
The transformation Z% may be represented as a stochastic process with un- 


correlated increments, that is a “random measure’. Let gy (1a, .) be the indicator 


function 
(a<t<b) 


1 
151 
0 (t <a, t>b). uel) 


(Ia, t)= 
For every v€V we have g (Ii, .)v€L,. Hence we may define Zi lene) DY: 
Zilyay=Z olan 2) vs (152) 


It follows at once from (149) that 
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Z (1h, Aut pr)=AZ (en u)t+uZ (La, v) 
(Z (£2, u)|Z (12, v)> =|b—a| (ul) 


LIE) VAUD ae (OS) 153) 


—_ 


U(t)Z (Ia, u)=Z (lati, u) 
<a(t, u)|Z(1%, v)>=0 (t<a), 


and that the representation (146) may be written in the symbolic form of a 
“causal’’ stochastic integral 
t 
x(t, v)= | Z(dt', g(t'—t)). (154) 


—-o 


In the one-dimensional case the representation (154) was obtained already in 
1950 [Hanner, 1950], [Karhunen, 1950]. The methods of these papers are how- 
ever not easily extended to the multidimensional case discussed here. 


11. Application of the linear theory of stochastic processes to statistical 
mechanics 


Before taking up the discussion of problems connected with the probabilistic 
interpretation of the linear theory of irreversible processes, outlined in I we 
shall make some general comments on the application of the linear theory of 
stochastic processes to the statistical description of mechanical systems of macro- 
scopic dimensions. The stochastic processes which are used for the temporal 
description of macroscopic phenomena may always be associated with stationary 
processes. This possibility has been discussed in [Uhlhorn, 1960, b], and will be 
accepted in the present discussion. The stationary processes in question corre- 
spond to closed mechanical systems in macroscopic equilibrium. The stationary 
probability distributions which describe the fluctuations of the macroscopic ob- 
servables in the equilibrium state are usually assumed to be sufficiently char- 
acterized by their first and second order moments: the mean values and the 
covariances of the observables. The justification of this assumption rests more 
or less directly on the law of large numbers, in particular on the central limit 
theorem. Formally the assumption is expressed by the statement that the prob- 
ablility distribution in question are approximately Gauszian. By the same argu- 
ment one can claim that the probability distributions of values observed at 
different times are approximately Gauszian, though the approximation may be 
worse than for the stationary distribution. In principle deviations from the 
Gauszian form can be obtained in a systematic way by the use of a multi- 
dimensional Cramér-expansion in terms of a parameter proportional to the size of 
the mechanical system [Uhlhorn, 1960 a]. One may even approximately replace 
the stationary stochastic process which describes the kinetics of fluctuations by 
a Gauszian process having the same first and second order moments. 

Let us assume that the stationary stochastic process 2(.,.) introduced in 
section 7. corresponds to the measurement of an m-tuple of macroscopic ob- 
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servables. We assume that the mean values are normalized to zero and that 
the observables are real 
E x(t, u)=0. 


The probability distribution of any finite set X(t, Uz), ..., L(tp, Uy) can be obtained 
from the characteristic function, which in the Gauszian approximation is equal to 


(meee | ] bia 2 
i > G20, us) =exp \- 3 D> D> kik; (us| R(t —t)| w) (Eien pemeall) 


E exp > 
j=1 

(155) 
The family of all finite dimensional characteristic functions characterize the sto- 
chastic process. In the Gauszian approximation the covariance operator R(t) is 
sufficient to characterize the process. We recall that Gauszian random variables 
are stochastically mdependent if and only if they are uncorrelated, corresponding 
to a factoring of the characteristic function. 

From the form (155) of the characteristic function it is also easy to obtain 
the theorem, proved by Grenander in the one-dimensional case [Grenander, 
1950]: A stationary, Gauszian stochastic process whose covariance operator R (t) 
has the spectral representation 

R(t)= { exp {idt}do(A) (156) 


—-30 


is ergodic exactly if the spectral distribution operator o(A) is a continuous 
function of A. 

It is obvious from the Cramér-representation (124) of the Gauszian process 
that the condition is necessary. For if ao(A) has a jump at A=A, there exists a 
%€V such that z(A+0, v9) —z(A—0, v9) =z, is a non-vanishing complex Gaus- 
zian random variable with mean zero, on the probability space of the stochastic 
process, satisfying 


U (t) a, =exp (ttg) z,» 


i.e. U(t) has an eigenvector. The stochastic variable |z,,| is obviously U (t)-in- 
variant but not equal to a constant. Hence the stochastic process is not er- 
godic if o(A) is not continuous. That the condition is also sufficient is most 
easily proved by the same method as in [Uhlhorn, 1960 b] for the case of dis- 
crete time. 
Usually we have no physical reasons to assume that o(A) has a singular 
continuous component (though such distributions can easily be constructed). 
Hence if the Gauszian process is ergodic its covariance operator will usually 
have the property 
+o 
lim R (t)= lim j exp {itd} o’ (A)dA=0. (157) 
eo 


|t|l>oo || 00 _ 


On the other hand (157) is sufficient to assure that the stochastic process is 
ergodic, and it in fact implies strong mixing as defined in [Hopf, 1937] and 
corresponding to complete stirring in Gibbs’ sense [Gibbs, 1902, Ch. 12] [Uhl- 
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horn, 1960 b]. In particular (157) implies that the conditional meanvalue of the 
m-dimensional observable described by 2(.,.) will approach its equilibrium value 
in the course of time: If the value v€V was observed at t=0, the conditional 
meanvalue at an arbitrary time ¢ is according to (71) 


v (t)=R (t)* R(0)* 0; v (0) =», (158) 
and hence lim v(t)=0. (159) 
t>+0 


It should be remarked that the fluctuation around the conditional mean value 
(158) is determined by (72) and is always smaller than in the stationary state. 
Hence the meanvalue is representative for the process. 

If the stationary Gauszian process x(.,.) is. non deterministic its spectral 
distribution operator o (A) is certainly absolutely continuous, and the process is 
strongly mixing. 

We remark that the time average of the stochastic process x(t, w) is equal 
to its phase average (=mean value) 


lim a x(t, u)dt=0 (160) 
too 9 
to 
1 
if and only if lim re | (w| R (t)| vw) dt=0, (161) 
0 
0 


and if and only if the spectral distribution (w|o(A)|w) has no jump at 2=0 
[o(+0)—o(—0)]u=0. (162) 


In the terminology of [Uhlhorn, 1960 b] x(0,w) is an ergodic element of the 
Hilbert space of the stochastic process. 

The conditions (160), (161), (162) are obviously much weaker than ergodicity, 
but may be sufficient for certain physical applications. 


12. Considerations on time smoothing and smoothing of the spectrum 


The values of macroscopic observables occurring in the statistical mechanics 
of non equilibrium phenomena are usually interpreted as averages over an inter- 
val of time necessary for the measurement. The physical reason for this averag- 
ing or time smoothing may be formulated as follows. The observable A is re- 
presented by a definite phase function or operator in the state-vector Space of 
the mechanical system. When it is stated that the value of the observable A 
at the time ¢ is A’ this means that a measurement has been made in a time 
interval covering t, and we do not know to which time ¢’ in this interval the 
value A’ really belongs. 

Mathematically this may be formulated in terms of a stochastic observation 
time with a probability distribution extending over the indicated time interval. 
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More precisely: the observation time is considered as the representative of a 
stochastic process t (t), with stationary increments and mean value t. We shall 
now discuss the effect of this interpretation of the observation time on the de- 
scription of series of measurements. 

The representative of the observable A and the equations of motion of the 
mechanical system generate a Hilbert space in the usual way. On this Hilbert 
space the kinetics is determined by the unitary translation operator 


+ 00 


U(t)= | exp (GAtd&,. (163) 


— 0 


According to our interpretation of the observation time we have to replace 
U(t) by the operator U(t(t)) which is random through its dependence on t (t). 
We shall be interested in the mean value of U(r(t)). We write 


z (t)=t+6 (t) (164) 
and denote the characteristic function of 6(t) by 
g(A)=E exp {iA 6(t)} (A real). (165) 


As @(t) is stationary (A) is independent of t. Hence we obtain formally the 
mean value of U (r(t)) 


+o + 09 +} 00 

E [ exp {i(t+6(t))A}dH,= [{ (A) exp {itd}dH,= [ p(a)dB,U(t). (166) 
aS 

Hence we have obtained a linear time smoothing operator i gy (A) d BE, which com- 


mutes with U(.). If the probability distribution of 6 (t) is derived from a density 


we have 
lim (a)=0 (167) 


|A|>+o 


and the effect of the time smoothing is seen to be a damping of the higher 
frequencies in the spectrum of U(.). If the probability distribution of 6 (#) is 
symmetric we have w(A)=gy(—A) and the time smoothing operator commutes 
with the time reversal operator 7, that is time smoothing does not destroy 
time-reversal symmetry. 

It has sometimes been suggested that time smoothing is responsible for the 
irreversible appearance of macroscopic phenomena.! Obviously this is not the case. 
Irreversibility and the approach to equilibrium are connected with the character 
of the spectral distribution of U(t), in particular with the absence of jumps in 
this distribution and this character is obviously unaffected by time smoothing. 

A smoothing of the spectrum on the other hand will always give rise to ir- 
reversibility. In its most crude form such a smoothing is achieved by the in- 
troduction of a damping factor. Sometimes a smoothing of the spectrum can 
be obtained as an asymptotic result when the size of the mechanical system 


1 Kirkwood, lectures given at Varenna in 1959. 
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increases infinitely. Such results can be obtained e.g. for linear mechanical sys- 
tems with translation symmetry (linear chain and similar systems [Hemmer, 1959]). 
In this connection it must be realized that though the (point) spectrum of U (é) 
will become denser with increasing size of the mechanical system this does not 
at all imply that the spectral distribution will become absolutely continuous. 
On the other hand a number of investigations on the approach to equilibrium 
of mechanical systems (van Hove, Prigogine) are based on the not easily justified 
assumption that the spectral distribution of U (t) is absolutely continuous (or 
rather on assumptions equivalent to this). Hence it is not surprising that these 
investigations can demonstrate the existence of irreversible effects. A more thor- 
ough examination of the asymptotic spectral properties of U(t) for mechanical 
systems of macroscopic dimensions would be of great value. However a very 
large number of degrees of freedoms of a mechanical system is not a necessary 
condition for the existence of an absolutely continuous spectral distribution. In 
fact one can show that the so called ““Normalsysteme’” which are known to 
be ergodic [Poincaré, 1892], [Fermi, 1923] are characterized by an absolutely 
continuous spectral distribution, and are hence strongly mixing in the sense of 
[Hopf, 1937]. 


13. Probabilistic interpretation of the linear theory of irreversible processes 


In order to obtain a probabilistic interpretation of the linear phenomenological 
theory of irreversible processes outlined in I ve associate with the m observables 


ee (168) 


which determine the macroscopic state of a mechanical system with a stationary 
m-dimensional real stochastic process (section 7) 


{x (t, v)|— co <t< +00, vEV}. (169) 
The correspondence between the stochastic process x(.,.) and the phemenolog- 
ical equations is determined by the assumption that the time dependence of 
the conditional mean value (158) is given by the phenomenological equations 


I (11). With the aid of the operator @(t) introduced in I (20) this correspond- 
ence can be expressed in the form 


R(t)* R(0)'=Q(t)=exp {-tM} (tS 0). (170) 


It is however not a priori evident that the relation (170) really defines a 
covariance operator. In order to show that the definition is consistent we must 
demonstrate that R(t) defined by 


=Q(—#)R(0) (t<0) (171) 
satisfies the positivity condition (7). 
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For this purpose we determine the Fourier transform 


a P : Il 
=| R(t) exp {—et} exp {—iAt}dt =— R (0) [M* +e-i7} (172) 
12 
0 


256 2 


am 


which exists for sufficiently large ¢>0. By the inversion theorem 


ee exp {— et} R(t) (¢>0) 
=a R (0) (M*+e-iA}! exp {id= 1R(O) (#=0) (173) 
4 7t 


—2 ) (¢<0). 


That is [M*+e¢-—iA]' is a causal transform. Hence we find using (171) the 
following representation for R (t) 


+0 
> 


R (t) = lim = | {R (0) [(M* + e—ial*+[(M+e+iay' R(0)} exp {iAthda 
el0 +7 
= lim oe | [M+e+id}-? [MR (0) +R (0) M*+2¢R(0)]x (174) 
ef 0 47 


x[M*+e-—tA]* exp fidt}da. 
It follows that F (t) satisfies the positivity condition (7) if and only if 
M R(0)+R (0) M*>0. (175) 


Comparing the condition (175) with the condition for non negative entropy 
production I (26) a. we find that a possible choice for R(Q) is 


R(0)=xs (x>0), (176) 


where s is the second order entropy operator I (20). A statistical mechanical 
consideration shows that, with x= Boltzmann’s constant,! this choice is correct. 
Hence we have found that 


RiGj=s7 Ot)  t20 (177) 
is the covariance operator of a stationary stochastic process #(.,.), and its 
spectral distribution can be determined from (174). 

The stochastic process x(.,.) associated with the covariance operator R (t) (177) 


is variant under time reversal, for the time reversal symmetry of the pheno- 
menological equations (Onsager’s reciprocal relations) is equivalent to (I, eq. (25)) 


Se Od ale LO (178) 


1 In the following we put Boltzmann’s constant #= 1. 
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and hence to Rij? = Tae i ie it): (179) 


This is exactly the condition (41 b). for time reversal invariance. 
The relation (175) can also be obtained as a necessary condition for the non 
negativity of the estimation error variance (73) 


R(0)— R(t)* R(0)7' R(t) = R(0) — Q(t) R(0) Q(t)* > 0 (t=). (180) 


This expression is zero for f=0 and hence its derivative for ¢=0 must be non 
negative. This is exactly the condition (175). It follows from I (47) (d= (0)) 
that the estimation error variance increases monotonically to R(0)—d(co). In 
particular the phenomenological equations are strictly irreversible if and only if 
d(co)=0, that is if and only if the estimation error variance reaches its maximal 
value in the course of time. 

On the other hand the phenomenological equations are strictly irreversible if 
and only if the operator M has no purely imaginary eigenvalues. In this case 
the spectral distribution o(.) of #(t) has a density 


o (A)=f{(4) (A), {C= [ROM + ROM 1A) (181) 


Obviously Jf(.)v is a causal transform (section 9). We introduce g(.) in anal- 
ogy to (110) 


1 , 0 (t>0) 
t) =—— | exp {-itda}f(A)dd= : ‘ 82 
1O~ Fe | oP (HIM uerarropac—o ¢<o, 1) 
and obtain a causal representation of the non deterministic process (154) 
t 
x(t,v)= [ Z(dt, [R(0) M*+ MRO) Q(t-t’)0), (183) 
in vector notation (section 7 eq. (35)) 
t 
x(t)= { Q(t-t’)[R(0)M*+MR(0)} Z (dt). (184) 
The expression (184) is the solution of the generalized Langevin equation 
d, x (t)= —M x (t)dt+[R (0) M*+ M R (0)}! Z (dt), (185) 


that is the phenomenological equation I (11) including a purely random term. 
It should be observed that the Langevin equation is here obtained as a result 
of the assumption (171). In order to obtain the representation (183) we had to 
assume that M had no purely imaginary eigenvalues. As pointed out this is the 
necessary and sufficient condition for the phenomenological equations to deter- 
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mine strictly irreversible processes. Hence we find that the concepts irreversible 
thermodynamic process and non deterministic stochastic process correspond ex- 
actly to each other. For the existence of a purely imaginary eigenvalue of M 
would give rise to a jump in the spectral distribution and the stochastic pro- 
cess would contain a deterministic component. 

We know that in general the stochastic process a(.,.) is an orthogonal super- 
position of a deterministic and a non deterministic component. It is clear that 
for the particular covariance operator (177) this decomposition will correspond to 
the decomposition of the spectral distribution operator into an absolutely con- 
tinuous and a discrete part. In order to study the general case a little closer 
we rewrite the equations I (46), (47) (d= R (0)) 


0 
R(0)= | Q(-t') R(0)Q(-t/)* +d (x) 


: (186) 
= { Q(-#) [MR (0) +R 0) M*]Q(-#')* dt’ +d (0). 
a *>4 = * = 
With gp —- MRO +R OMY Q(-H (<0) 87) 


0 (=0); 


and recalling that 
Q(t) d (co) Q(t)* =d(co) or equivalently Md(oo)+d(cc) M*=0, (188) 


we obtain the following representation of F(t) valid for positive and negative 
values of ¢ 


R(t)= | gC) gC —bHdt'+d(~0)Q" 


(189) 
+o 
= f git’ +t)*g(t’)dt’'+Q(—t)d(c). 
This is the desired decomposition 
+ 00 
Ry (t)=d() Q(t)", RO= f g@)gt—Hat (190) 


for evidently R,(t) has the form (148) and the spectral distribution of Ry (t) is 
discrete as is seen from the Fourier integral representation corresponding to (174) 


2 “an 
Rg (t) = lim 5 f UM +e+idy*2ed(co)[M*+e—-sA] exp {iAty da. (191) 
BO, 2ae o. 


On the other hand (190) corresponds to a decomposition into a reversible and 
an irreversible system (I section 3). 
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We denote the deterministic component of the stochastic process x(.,.) by 
ay(.,-), the non deterministic by 2; (., -). The deterministic component is char- 
aetcrized by its vanishing estimation error variance and hence according to (71) 


ay (t, ¥) =a (0, Q(t)" v) or aq (t)=Q @) % (0). (192) 


The non deterministic component on the other hand has the causal representa- 
tion (183) (184). The composed process has the form 


t 


x (t)=Q(t) x (0)+ f Q(t-t') (RO) M*+MR (OZ (at), 


: (193) 
a(t)=Q(t)e(0)+ f Qt-t)[R(0) M*+MRO)]}EZ At), 
and satisfies the Langevin equation 
4 .@)—-—-Met)+(R 0) +R 2 (194) 


dt 


We have thus seen that the phenomenological linear equations of irreversible 
thermodynamics imply their probabilistic interpretation almost uniquely. It 
should be noted that the Langevin equation (194) is here derived as a resuit 
of the phenomenological theory whereas it is usually taken as the starting point 
for theories of irreversible processes based on generalized Brownian motion Lea 
sager and Machlup, 1953] [Hashitsume, 1952, 1956]. 

‘In the Gauszian approximation the conditional probability distribution of x (é, .), 
given x(0,.), is most easily obtained from its characteristic function and the 
representation (193) of w(t, .). We recall that the random measure Z(.,.) cor- 
responds in the Gauszian approximation to a multidimensional Gauszian process 
with independent increments. The characteristic function is (v real € V) 


E {exp {i x(t, v)}|x (0, .)}J=C, v) 


exp {ix(0, Q(t)* v)} exp {- ile (0) ae a OE Or=1)" 


(195) 


exp {72 (0, Q(t)" v) —3 (v| R (0) — Q (t) RO) Q ()* | v)}. 


It is seen that the characteristic function C (f,v) satisfies the partial differential 
equation (in coordinate representation, comp. I.) 


Q 
ot 5 Vagee : +L +LV Ve] OC v) =0 (196) 


av, 


which is the Fourier transform of the Fokker-Planck or master- -equation and 
can be used as a natural starting point for an extension of the statistical theory 
of irreversible processes to non linear phenomenological equations. 
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APPENDIX 1 


Radon-Nikodym decomposition of a spectral measure 


Definition [Halmos, 1951]: If Q is a set with a specificd o-field F of subsets, 

a spectral measure in Q is a function #(.) whose domain is F and whose values 

are projections on a Hilbert space H, such that H (Q)=1 and such that E (UF,) 
n 


=> E(F,,) whenever the F, form a disjoint sequence of sets belonging to ZF. 


n 

Let w(.) be a non negative o-finite measure in QQ. 

For any element x in H, ||H(.)2||? is a normalized measure in Q and may 
according to the Radon-Nikodym theorem be uniquely decomposed into a - 
continuous and a w-singular part 


|| Z (dw) «||? =, (w) wu (dw) + pz (do), (Al: 1) 


where g,(.) is mw-integrable and y,(.) vanishes outside a w-nullset N;,. 
We define the operation H* on H by 


EP a=EH (N,) x. (Al: 2) 
The operation £° is linear, for 


ES (a@+y)=E (Nesy) (e+y)=EH(N;zU N,) (x +y)=E(N,U N,) e+ 


(Al: 3) 

+H(N,UN,)y=Eoat Ey. 

The second step of (Al:3) follows from the triangular inequality 
WEF (.)@+y|<2C) + 2O) yl), (Al: 4) 


showing that ||#(.)(«+y)|| vanishes on any p-nullset outside N,U Ny. The 
operator H*° is now directly seen to be a projection which commutes with any 
E (Ff) 

ES EB (F)a=H (N,N F)x=E (F) HE’ x. (Al: 5) 


Defining the spectral measures #*(.) and £*(.) by 
B(Y=LE PF); £ (f= (f)-L f), (Al: 6) 


we have succeeded in decomposing the spectral measure E (.) into a p-continuous 
and a w-singular part. The two parts are orthogonal. If NV is a y-nullset we 


have: 
uu (N)=0 implies £*(N)=0. (Al: 7) 
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If the Hilbert space H*H is separable E*(.) vanishes Ouiae a w-nullset, for 
let {,} be a sequence which is dense in H* H, and take N = U N,,, then uw (N)=C 


Let x€E°H and ||x—<a,||<e, then || B (N) (x— a) y= N)a#—2,||<e and 
\|e—-E(N)a||<2e, that is H(N)2=2 for ae element a in rH and 


E’ = E’ (N)=E(N). (Al: 8) 


APPENDIX 2 


A study of the Wey] relation 


1. Let {U (t)|—co<t<+0c} and {V (t)|—co<t< +c} be two one-para- 
meter groups a unitary transformations on a Hilbert space H, such that for 
arbitrary elements x and y in H,<y|U (t)|#> and <y| V(t t)| a> are continuous 
functions of ¢t. Let the two groups be abies by the bees relation 


U (s) V (t) U(s)" V(t)" = exp {ist}. (A2: 1) 
2. According to Stone’s theorem the spectral decompositions of the two 


groups are 


+00 
U (s)= | exp {ist} dE, (A2: 2) 


—oo 


+ co 


V (s)= i exp {ist}d F,, (A2: 3) 


—o 


with uniquely determined spectral distributions #. and F’. on the real axis. That 
is for arbitrary real ¢, H,, and F; are projection operators on H satisfying 


E;, E:,=L min tty; lim H,=0; lim #,=1; ae Ei, =; 


t+ — 00 t—>-+ 00 
and the corresponding relations for F.. 


3. The spectral distributions #. and F. satisfy the relations 


U (s) F,= Fi, U (s), (A2: 4) 
E, V (s)= V (s) E445. (A2: 5) 

Proof: For arbitrary x and y in H we have, according to (A2:1), 
<y|U (8) V ®|>=<y| V(t) U (s)|a> exp {— ist}. (A2: 6) 


Introducing the spectral decomposition (A2:2) of U(s) in (A2:6) we obtain 
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+o +00 
I exp {isu} d, <y|E, V (t)|2)>= f exp {is (u—t)}d,<y| V(t) E,| 2) 
me (A2:7) 
= i) exp {isu} d, <y| V (t) Hus:|2). 


By application of the inversion formula for the Fourier-Stieltjes transform to 
(A2:7) the equality (A2:4) follows. The equality (A2:5) follows from (A2: 4) and 
the symmetry of (A2:1). 


4. The spectral distributions #. and F. are absolutely continuous with re- 
spect to the Lebesgue measure on the real axis. 


Proof: Let E(.) and F(.) be the spectral measures on the Borel field of R} 
determined by #. and F.. That is if J is the interval (a<t<b) 


BE(I1)=#,—-H, F(UI)=F,—F,. (A2: 8) 
It follows from (4) that if 73 is an arbitrary finite interval 
BI) UG) F Qi) =f 02) F G29) UV © =0 for |t\> OG =a). (A2: 9) 


Hence for an arbitrary element x€H according to (A2: 2) 


J exp {tAt}d,<F (12) x| Z| F (12) x> =0 for |t|>(b-a). (A2: 10) 


-o 


This is possible only if <F(1%2)2|H£,|F 12)2>=||Z,F (12)a||? is absolutely 


continuous 
Is 


| GlePade 


—oo 


2d )A=\| By F (Iz) 0\l. (A2: 11) 


Let E® be the projection corresponding to the singular part of the spectral meas- 
ure H(.) (appendix 1). Equation (A2:11) is equivalent to 


|| #° F (12) a||=0 or BS Pox=E° Fax (A2: 12) 


and this relation holds for any finite interval Jj. The limits of (A2:12) for 
b->+0oc, a>-—oo exist and hence #’x=0 for any x€H so that we have found 
E°=0. From the symmetry of (A2:1) it follows that also #(.) is absolutely 
continuous and the proposition is proved. 


5. The spectral distributions H. and Ff. have no gaps, that is 
t+t' implies #,+ Hy, and #,+ Fr. (A2: 13) 
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Proof: If for some t+t’ E,=E, it follows from (A2: 5) and the monotonic 
character of HE; as a function of ¢t, that E, is independent of ¢. Hence #,=0 
and £,=1 which is a contradiction (see section 3). 


6. Every non zero element x,€H determines a subspace H [2] which is 
separable and invariant under both U(.) and V(.). The linear manifold spanned 
by all (finite) linear combinations of elements of the form 


UV ey, UG eRr) (A2: 14) 


is obviously on account of (A2:1) invariant under U(.) and V(.). Closing this 
manifold we obtain a subspace which is invariant and separable due to the fact 
that U (t) and V(t) are continuous functions of f. 

Hence it is no essential restriction to assume that the Hilbert space H is 
separable when studying the relation (A2: 1). 


7. If the Hilbert space H is separable, and if in Weyl’s relation (A2:1) V (.) 
can be replaced by the group V,(.), then V(.) and V,(.) are connected by 
a unitary transformation W, independent of f, 


V,(t)=W V (t) W" (A2: 15) 


which commutes with all transformations of the group U(.). Conversely, if W 
is a unitary transformation commuting with U(.) the group V )(.) defined by 
(15) will evidently satisfy Weyl’s relation. 


Proof: In order to prove the first statement we shall derive an explicit re- 
presentation of the Hilbert space H and of the groups U(.) and V(.). If g, 
is an arbitrary non zero element in H a subspace, invariant under U (.) is gen- 
erated by all elements which are limits of elements of the form 


2,¢ (t) U (t) gy (A2: 16) 


where c(t’) is an arbitrary complex function which is equal to zero for all but 
a finite number of values of its argument. In general this subspace is a proper 
subspace of H. If so we can find a non zero element gy which is orthogonal 
to the subspace generated by g, and U(.) and a second subspace generated by 
g. and U(.) is obtained. This procedure may be continued in an obvious way 
and as the Hilbert space H was assumed to be separable we obtain finally a 
U (.)-invariant decomposition of H into finitely or denumerably many mutually 
orthogonal subspaces, generated by the elements 


Ji> Jos eee yg Yn» siejei ie H (A2: 17) 


It is possible to choose the generating elements (A2:17) such that for any Borel 
set Ac R? 


E (A)g,=0 if and only if H(A)=0, (A2: 18) 
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for if this condition if not originally satisfied we normalize the elements (A2=17) 
so that 
n=oo 
BS DGns Il 
=1 


n 


= 2 ll gn ||? < 20 (A2: 19) 


is convergent. It follows at once that (A2:18) is satisfied with g, replaced by 
h, for H(A)h=0 implies £(A)g,=0 for every n and hence E (A) «=0 for every 
x in H. We may take h as the first generating element and then continue the 
procedure described above. In the following we assume that the g, (A2:17) have 
been chosen so as to satisfy (A2:18). Denoting by H, the subspace generated 
by g, and U(.), we have for any x€H a unique decomposition 


es SS 2H, (ois) (A2: 20) 


and for each component x, we have (see e.g. [Achieser and Glasman, 1954]) a 
representation 


+ 60 


Ly = J x(n, A)d Li gn, (A2: 21) 


— 0 


where x(n, A) is a complex function, square integrable over Rt with respect to 
the measure d;||#;g,||? such that with y€H 


<e|y>= > <ealynr= > Jam, Ady (m, A)dil|Zign|?. (A: 22) 
n=1 n=1 _¢ 


The spectral distribution #. is absolutely continuous so that we can write 
d,||Eign | =|f (nm, a) |? aa. (A2: 23) 
As the spectral distribution HZ. has no gaps and as g, satisfies (A2:18) we have 
f(n, A4)+0 (for almost every A). (A2: 24) 
We define for any ~€H 
<n, 2|x>=x (n, A) f (n, A); <n, Alay =<a|n, 2. (A2: 25) 


It follows from (A2: 22) that 


elyy= "5 | Gln, ayaace, Alyy. (A2: 26) 
n=1 


For every fixed n (A2:25) represents in view of (A2: 24) an isometric biunique 
correspondence between the subspace H,, of H and the Hilbert space L*(— 00, + 00) 
of functions square integrable over the real axis with respect to the Lebesgue 


measure d J. 
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As E. is the spectral distribution corresponding to U (.) (A2:2) we have 
<n, A|U (t)|x> = exp {i At} <n, A|x> (for almost every A) (A2: 27) 


or symbolically 
t)|n, A> = exp {iAt}|n, AD. (A2: 28) 


We note that <n, A| may be considered as a mapping of H, onto L?(— 0, 0) 
or as a generalized eigenvector of U (¢) in the sense of [Dirac, 1947] with the 
symbolic orthogonalization condition 


<n, An’, A’) =Sn.ny 0 (4-2), (A2: 29) 


which has the proper meaning (A2:26). The mapping (A2:25) is determined by 
the isometry condition (A2:26) up to the phase of 


fn, Aan, A192 (A2: 30) 


We are now going to determine the <n, A|-representative of V (¢t)x for an 
arbitrary «€H, <n, A|V(t)|2>. We start by representing V (t)g, in the form 
(A2: 20), (A2: 21) 


+00 


V(t) 9n= | v(A, A+t; m, n)d Ly gm (A2: 31) 


where v(A, A+t; m,n) is a complex function, square integrable with respect to 
d, || By Im ||" for feed m, and such that 
+00 
m=co 


pS |v (A, A+¢; m,n) |? dy || Bi gm||?< + o0. 
m=1 


The representative of V(t)” is now obtained by use of the commutation rela- 
tion (A2:5); with y arbitrary 


+00 


<y|V (t)| “>= 2. | a (m, A) da <y| V (t) Bal gm> 


=) | x(m, A+ t) da; <y| Ey V (t)|9m> (A2: 32) 


+00 


=a): x(m,A+t) Se (A, A+t; n, m) d,<y| Bal gn>. 


—o 
The order of summation may be interchanged and as y is arbitrary we find 
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ni, A\V@le>= Dd vA A+E vn, Mi) <M, Ato (A2: 33) 
m=1 


In order to obtain the representative of V (t)*x we consider 


+00 


} dacy 


S100) 


Ky | Vitey |ao= CV (t)y|az>= > > AA tomy Attn) m) Gh, Alay, 


=e ek [a2 Gln, meat Aa; m, n) <m, A—t|2, 


m=1n=1 
=e) | diac<y|n, Ad <n, A| V (t)* |x). 
M1 
Hence <n, A| V (t)* |2> =v(A—t, 4; m, n) <m, A-t| 2). (A2: 34) 


and as V (t)*=V(-—t) 


v(A, A—t; n, m)=v(A—t; A; m, n). (A2: 35) 
The group property of V (.) 
VQVO)HVEH) . (A2: 36) 


gives rise to the following relation for the representative v (A, A’; m,m’) 


m'=c0 


> v(A, 3m, m)v(N, 2°; mm’, m”)=v(A, A"; m, m”) (A2: 37) 


m’=1 
as is easily verified. Denoting by v(A, 2’) the matrix 
{vy (A, 1; m, n)|m, n=1, 2, ...} 


we can express the group property (A2: 37) and the unitarity (A2: 35) in the form 
v(A, A) v (a, A HVA, A"); VA, A=, 


(A2: 38) 
Vs) =vd,A=viA, 2). 
With /, an arbitrary fixed real number we can write 
UAL ARV AS AY Ved): (A2;: 39) 
Further we may define a unitary transformation W on H by 
<n, 4|W|2>= La v (Ag, A; n,m) <m, Al a>. (A2: 40) 
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It is evident that W and hence W* commute with all transformations of the 
group U(.). Further according to the construction 


<n, A| WV (t) W*|2> =<n, A+t| 2), (A2: 41) 
showing that W V(t) W* is represented by the group of translations, when U (f) 
is represented by the multiplicative operation (A2:27). Otherwise expressed: by 
a proper choice of the generating elements (A2:17), U(.) is represented by the 
operation (A2:27) and V(.)* by the group of translations. We have only to 
replace the generating elements 9,, go, ..., by W*g,, W* gg, ... in order to obtain 
the desired representation. 
To make the proof of the proposition 7 complete, let V)(.) satisfy Weyl’s 
relation (A2:1). There will then exist a unitary W, such that 
W, V,(t) Wo =W V (t) W* 
is represented by the group of translations, and hence 
V, (t)= Wo WV (t) (Wo W)* 
as was to be proved. 
8. The groups U(.) and V(.) are unitarily equivalent 
UOQ=SV OS, Acosta os) a 5 (A2: 42) 
Proof: As by a proper choice of generating elements we have 


<n, A|U (t)| > = exp {i At} <n, A|a> and <n, A| V (t)|2>=<n, At+t|x 


we can define S as the Fourier-Plancherel transformation (section 9) 


+00 
1 
<n, 4|8| z= —— | exp { —tAt} dé <n, t|z). (A2: 43) 


9. A natural generalization of Weyl’s relation which has applications in physics 
as well as in the theory of stochastic processes is obtained by considering a 
representation of a locally compact, commutative group G (see e.g. [Pontrjagin, 
1946]) by unitary transformations {U (t)|£€G@} in a Hilbert space H such that 
<x|U (t)|v> is a continuous function of the group element ¢. A character ¢ (aa) 
of the group G@ is by definition a complex, continuous function on G@ satisfying 


é(t,)e (f.)=e(t,t,.)  |e(t)|=1. (A2: 44) 


The characters of G form the commutative character group G@,, with the com- 
position 


€, (t) e, (t) =e, e, (£). (A2: 45) 
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With G the character group G, is locally compact. The relation between G and 
G, is symmetric. Let {V (e)|e€G.} be a representation of the character group 
by unitary transformations which are continuous functions of their argument, 
and let the groups U(.) and V'(.) be connected by the generalized Weyl relation 


Vitel Gy Pier a) =e Gy. (A2: 46) 


It can be shown, though we shall not give the proofs, that most of the prop- 
ositions about the Weyl relation (A2:1) discussed above remain true in a gen- 
eralized sense for the relation (A2:46). In particular there are essentially two 
representations of the relation (A2:46), one in terms of the Hilbert space L? (@) 
of complex functions on G, square integrable with respect to the invariant (Haar) 
measure (.), and one in terms of the Hilbert space L*(G,) corresponding to 
the character group. Hence we will obtain a decomposition of the Hilbert space 
H into orthogonal subspaces each invariant under U(.) and V(.) and isomet- 
rically equivalent to L*(G) and to L*(G,). In fact if x is an element of one 
such subspace and if its L*(G@)-representative is <t; G|x>, its L* (G,)-representa- 
tive <e; G.|~2> we have the generalized Fourier transformation 


<0; Gla = | e (t) <t; G|x> u (a6), (A2: 47) 
G 
<t; Gla>= [ e(t) <e; Go| a> pe (de), (A2: 48) 
Ge 
lle? = [|< Ela> Pw (dt) = [| <e; Go| a>]? me (AE). (A2: 49) 
G Ge 


The equation (A2:49) is the general form of Parseval’s relation. Further the 
representations of U(.) and V(.) are given by 


<t; G| U (t,)|2> =<t’ & G|x), (A2: 50) 
dt; G| V (e)| 2 =e (t) <; G| a, (A2: 51) 


and hence by application of (A2:47), (A2: 48) 
ge; G,| U0) a> =e (t,) <6 G.|2>, (A2: 52) 
<e; G,| V (e,)| a> = <e,e; Ge| >. (A2 53) 


Possible and important examples of (@ are: the group of translations ma 
finite-dimensional Euclidean space, corresponding to a finite set of canonical co- 
ordinates in quantum mechanics, the group of integers with addition as com- 
position, the group of translations mod 27 etc. 
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